Monte Carlo Structured SVI for Non-Conjugate Models
نویسندگان
چکیده
The stochastic variational inference (SVI) paradigm, which combines variational inference, natural gradients, and stochastic updates, was recently proposed for large-scale data analysis in conjugate Bayesian models and demonstrated to be effective in several problems. This paper studies a family of Bayesian latent variable models with two levels of hidden variables but without any conjugacy requirements, making several contributions in this context. The first is observing that SVI, with an improved structured variational approximation, is applicable under more general conditions than previously thought with the only requirement being that the approximating variational distribution be in the same family as the prior. The resulting approach, Monte Carlo Structured SVI (MC-SSVI), significantly extends the scope of SVI, enabling large-scale learning in non-conjugate models. The second contribution is developing the algorithmic details of MC-SSVI for two challenging models. The application of MC-SSVI to probabilistic matrix factorization (PMF) yields an algorithm which is efficient and generic in that it is applicable to any type of observation likelihood, with improvements in convergence speed and in general applicability over previous work. The application of MC-SSVI to the correlated topic model (CTM) improves over previous work which used the much stronger mean field variational approximation. An experimental evaluation demonstrates the advantages of MC-SSVI.
منابع مشابه
Gibbs Sampling for Bayesian Non-Conjugate and Hierarchical Models by Using Auxiliary Variables
We demonstrate the use of auxiliary (or latent) variables for sampling non-standard densities which arise in the context of the Bayesian analysis of non-conjugate and hierarchical models by using a Gibbs sampler. Their strategic use can result in a Gibbs sampler having easily sampled full conditionals. We propose such a procedure to simplify or speed up the Markov chain Monte Carlo algorithm. T...
متن کاملMonte Carlo characterization of photoneutrons in the radiation therapy with high energy photons: a Comparison between simplified and full Monte Carlo models
Background: The characteristics of secondary neutrons in a high energy radiation therapy room were studied using the MCNPX Monte Carlo (MC) code. Materials and Methods: Two MC models including a model with full description of head components and a simplified model used in previous studies were implemented for MC simulations. Results: Results showed 4-53% difference between full and wit...
متن کاملSequential parameter learning and filtering in structured autoregressive state-space models
We present particle-based algorithms for sequential filtering and parameter learning in state-space autoregressive (AR) models with structured priors. Non-conjugate priors are specified on the AR coefficients at the system level by imposing uniform or truncated normal priors on the moduli and wavelengths of the reciprocal roots of the AR characteristic polynomial. Sequential Monte Carlo algorit...
متن کاملBayesian analysis of software reliability models with reference prior
In this paper, we introduce a Bayesian analysis for non-homogeneous Poisson process in software reliability models. Posterior summaries of interest are obtained using Markov chain Monte Carlo methods. We compare the results obtained from using conjugate and reference priors. Model selection based on the prequential conditional predictive ordinates is developed.
متن کاملSequentially-Allocated Merge-Split Sampler for Conjugate and Nonconjugate Dirichlet Process Mixture Models
This paper proposes a new efficient merge-split sampler for both conjugate and nonconjugate Dirichlet process mixture (DPM) models. These Bayesian nonparametric models are usually fit usingMarkov chain Monte Carlo (MCMC) or sequential importance sampling (SIS). The latest generation of Gibbs and Gibbs-like samplers for both conjugate and nonconjugate DPM models effectively update the model para...
متن کامل